# Low VRAM Inference

Qwen2.5 VL 7B Instruct GPTQ Int3
Apache-2.0
This is an unofficial GPTQ-Int3 quantized version based on the Qwen2.5-VL-7B-Instruct model, suitable for multimodal image-text-to-text tasks.
Image-to-Text Transformers Supports Multiple Languages
Q
hfl
577
1
Qwen2.5 VL 3B Instruct GPTQ Int4
Apache-2.0
This is the GPTQ-Int4 quantized version of the Qwen2.5-VL-3B-Instruct model, suitable for multimodal tasks involving image-to-text and text-to-text, supporting both Chinese and English.
Image-to-Text Transformers Supports Multiple Languages
Q
hfl
1,312
2
Smolvlm2 500M Video Instruct
Apache-2.0
A lightweight multimodal model designed for analyzing video content, capable of processing video, image, and text inputs to generate text outputs.
Image-to-Text Transformers English
S
HuggingFaceTB
17.89k
56
Smolvlm2 256M Video Instruct
Apache-2.0
SmolVLM2-256M-Video is a lightweight multimodal model specifically designed for analyzing video content, capable of processing video, image, and text inputs to generate text outputs.
Image-to-Text Transformers English
S
HuggingFaceTB
22.16k
53
Molmo 7B D 0924 NF4
Apache-2.0
The 4Bit quantized version of Molmo-7B-D-0924, which reduces VRAM usage through the NF4 quantization strategy and is suitable for environments with limited VRAM.
Image-to-Text Transformers
M
Scoolar
1,259
1
Meta Llama 3.1 8B Instruct AWQ INT4
INT4 quantized version of Llama 3.1 8B Instruct, quantized using AutoAWQ tool, suitable for multilingual dialogue scenarios.
Large Language Model Transformers Supports Multiple Languages
M
hugging-quants
348.23k
67
Rwkv 4 169m Pile
RWKV-4 is a large language model combining the strengths of RNN and Transformer, featuring high performance, fast inference, and efficient training
Large Language Model Transformers
R
RWKV
5,698
8
Moss Moon 003 Sft
MOSS is an open-source conversational language model supporting plugin enhancement, with 16 billion parameters, capable of Chinese-English dialogue and tool calling.
Large Language Model Transformers Supports Multiple Languages
M
fnlp
98
127
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase